Image Holistic Scene Understanding Based on Image Intrinsic Characteristics and Conditional Random Fields
نویسنده
چکیده
Image holistic scene understanding based on image intrinsic characteristics and conditional random fields is proposed. The model integrates image scene classification, image semantic segmentation and object detection. 1) For the scene classification, we use method of PHOW feature extraction plus KPCA dimensional reduction to obtain feature information for each image. 2) For object detection section, saliency detection and segmentation characteristics of the image object detection is useful. We propose the method by integrating image segmentation information got by the method proposed in literature [1]. 3) For the semantic segmentation: (1) For the unary potentials, we incorporating HOG, RGB color histogram and LBP features by the methods proposed in literature [2]; (2) The image manifold structural features can better reflect the importance between hyper-pixel regions and eventually boost accuracy. Therefore, we add the higher-order potential item to reflect inherent manifold image feature of each super pixel region. The experiments testify that model performance has raised on all three sub-tasks.
منابع مشابه
Human-Machine CRFs for Identifying Bottlenecks in Holistic Scene Understanding
Recent trends in image understanding have pushed for holistic scene understanding models that jointly reason about various tasks such as object detection, scene recognition, shape analysis, contextual reasoning, and local appearance based classifiers. In this work, we are interested in understanding the roles of these different tasks in improved scene understanding, in particular semantic segme...
متن کاملFusion Based Holistic Road Scene Understanding
This paper addresses the problem of holistic road scene understanding based on the integration of visual and range data. To achieve the grand goal, we propose an approach that jointly tackles object-level image segmentation and semantic region labeling within a conditional random field (CRF) framework. Specifically, we first generate semantic object hypotheses by clustering 3D points, learning ...
متن کاملUnderstanding Text in Scene Images
With the rapid growth of camera-based mobile devices, applications that answer questions such as, “What does this sign say?" are becoming increasingly popular. This is related to the problem of optical character recognition (OCR) where the task is to recognize text occurring in images. The OCR problem has a long history in the computer vision community. However, the success of OCR systems is la...
متن کاملRobust Fuzzy Content Based Regularization Technique in Super Resolution Imaging
Super-resolution (SR) aims to overcome the ill-posed conditions of image acquisition. SR facilitates scene recognition from low-resolution image(s). Generally assumes that high and low resolution images share similar intrinsic geometries. Various approaches have tried to aggregate the informative details of multiple low-resolution images into a high-resolution one. In this paper, we present a n...
متن کاملEfficient Structured Prediction with Latent Variables for General Graphical Models
In this paper we propose a unified framework for structured prediction with latent variables which includes hidden conditional random fields and latent structured support vector machines as special cases. We describe a local entropy approximation for this general formulation using duality, and derive an efficient message passing algorithm that is guaranteed to converge. We demonstrate its effec...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016